The complexity of computing the MCD-estimator
نویسندگان
چکیده
In modern statistics the robust estimation of parameters is a central problem, i. e., an estimation that is not or only slightly affected by outliers in the data. The Minimum Covariance Determinant estimator (MCD) [8] is probably one of the most important robust estimators of location and scatter. The complexity of computing the MCD, however, was unknown and generally thought to be exponential even if the dimensionality of the data is fixed. Here we present a polynomial time algorithm for MCD for fixed dimension of the data. In contrast we show that computing the MCDestimator is NP-hard if the dimension varies.
منابع مشابه
Nonsingular Robust Covariance Estimation in Multivariate Outlier Detection
Rousseeuw’s minimum covariance determinant (MCD) method is a highly robust estimator of multivariate mean and covariance. In practice, the MCD covariance estimator may be singular. However, a nonsingular covariance estimator is required to calculate the Mahalanobis distance. In order to fix this singular problem, we propose an improved version of the MCD estimator, which is a combination of the...
متن کاملRelaxMCD: Smooth optimisation for the Minimum Covariance Determinant estimator
The Minimum Covariance Determinant (MCD) estimator is a highly robust procedure for estimating the centre and shape of a high dimensional data set. It consists of determining a subsample of h points out of nwhichminimises the generalised variance. By definition, the computation of this estimator gives rise to a combinatorial optimisation problem, forwhich several approximate algorithms have bee...
متن کاملA Fast Algorithm for the Minimum Covariance Determinant Estimator
The minimum covariance determinant (MCD) method of Rousseeuw (1984) is a highly robust estimator of multivariate location and scatter. Its objective is to nd h observations (out of n) whose covariance matrix has the lowest determinant. Until now applications of the MCD were hampered by the computation time of existing algorithms, which were limited to a few hundred objects in a few dimensions. ...
متن کاملGeneralized Ridge Regression Estimator in Semiparametric Regression Models
In the context of ridge regression, the estimation of ridge (shrinkage) parameter plays an important role in analyzing data. Many efforts have been put to develop skills and methods of computing shrinkage estimators for different full-parametric ridge regression approaches, using eigenvalues. However, the estimation of shrinkage parameter is neglected for semiparametric regression models. The m...
متن کاملStudy Concentrations Changes of Lead, Chromium, Nickel, Cadmium, Zinc and Copper in Beshar River of Kohgiluyeh and Boyerahmad Province, Iran
Due to the lack of water quality of resourses in country, exact review of available water resources quality, especially in terms of heavy elements (heavy metals) has a particular importance. Due to industrial towns 1 and 2 of Yasuj city and also Yasuj wastewater treatment, surveying environmental harmful effects of heavy metals on this river is very important. To investigate the changes of heav...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Theor. Comput. Sci.
دوره 326 شماره
صفحات -
تاریخ انتشار 2004